Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 96211 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 20.5 MiB |
| Average record size in memory | 223.0 B |
Variable types
| Categorical | 9 |
|---|---|
| DateTime | 1 |
| Numeric | 15 |
order_status has constant value "delivered" | Constant |
customer_id has a high cardinality: 96211 distinct values | High cardinality |
customer_unique_id has a high cardinality: 93104 distinct values | High cardinality |
customer_city has a high cardinality: 4083 distinct values | High cardinality |
product_category_name_english has a high cardinality: 72 distinct values | High cardinality |
order_purchase_time has a high cardinality: 602 distinct values | High cardinality |
payment_value is highly overall correlated with sum_price and 2 other fields | High correlation |
sum_price is highly overall correlated with payment_value and 1 other fields | High correlation |
sum_freight_value is highly overall correlated with payment_value | High correlation |
product_weight_g is highly overall correlated with payment_value and 4 other fields | High correlation |
product_length_cm is highly overall correlated with product_weight_g and 1 other fields | High correlation |
product_height_cm is highly overall correlated with product_weight_g | High correlation |
product_width_cm is highly overall correlated with product_weight_g and 1 other fields | High correlation |
recence is highly overall correlated with recence_score | High correlation |
recence_score is highly overall correlated with recence | High correlation |
payment_type is highly imbalanced (57.9%) | Imbalance |
customer_id is uniformly distributed | Uniform |
customer_unique_id is uniformly distributed | Uniform |
customer_id has unique values | Unique |
length_comment_title has 85020 (88.4%) zeros | Zeros |
length_comment_message has 57489 (59.8%) zeros | Zeros |
product_description_lenght has 1359 (1.4%) zeros | Zeros |
product_photos_qty has 1359 (1.4%) zeros | Zeros |
Reproduction
| Analysis started | 2023-02-13 12:50:00.887464 |
|---|---|
| Analysis finished | 2023-02-13 12:50:46.454252 |
| Duration | 45.57 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
customer_id
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 96211 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 9ef432eb6251297304e76186b10a928d | 1 |
|---|---|
| 29a8e7dc609b301eeb68e597e333f912 | 1 |
| d887148b2d2b9e3d51736103399c3227 | 1 |
| 8c15169cec84935673c0356c2f151da4 | 1 |
| 27e4e9e54add87b994001667ceb67802 | 1 |
| Other values (96206) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3078752 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 96211 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 9ef432eb6251297304e76186b10a928d |
|---|---|
| 2nd row | 31f31efcb333fcbad2b1371c8cf0fa84 |
| 3rd row | b0830fb4747a6c6d20dea0b8c802d7ef |
| 4th row | 41ce2a54c0b03bf3443c3d931a367089 |
| 5th row | f88197465ea7920adcdbec7375364d82 |
Common Values
| Value | Count | Frequency (%) |
| 9ef432eb6251297304e76186b10a928d | 1 | < 0.1% |
| 29a8e7dc609b301eeb68e597e333f912 | 1 | < 0.1% |
| d887148b2d2b9e3d51736103399c3227 | 1 | < 0.1% |
| 8c15169cec84935673c0356c2f151da4 | 1 | < 0.1% |
| 27e4e9e54add87b994001667ceb67802 | 1 | < 0.1% |
| b83c6d5f769b0e788a6bbd435c6036aa | 1 | < 0.1% |
| 3615ad4473507f4acd0c1511578b796d | 1 | < 0.1% |
| 27b22920b041f339fc2ee118c3597c8a | 1 | < 0.1% |
| f2466a19138a60af2319a3693c6b2b9e | 1 | < 0.1% |
| 1d1ab35efaaa5ca38a3b34695e63bf03 | 1 | < 0.1% |
| Other values (96201) | 96201 |
Length
| Value | Count | Frequency (%) |
| 9ef432eb6251297304e76186b10a928d | 1 | < 0.1% |
| 7711cf624183d843aafe81855097bc37 | 1 | < 0.1% |
| 41ce2a54c0b03bf3443c3d931a367089 | 1 | < 0.1% |
| f88197465ea7920adcdbec7375364d82 | 1 | < 0.1% |
| 8ab97904e6daea8866dbdbc4fb7aad2c | 1 | < 0.1% |
| 503740e9ca751ccdda7ba28e9ab8f608 | 1 | < 0.1% |
| 9bdf08b4b3b52b5526ff42d37d47f222 | 1 | < 0.1% |
| f54a9f0e6b351c431402b8461ea51999 | 1 | < 0.1% |
| 31ad1d1b63eb9962463f764d4e6e0c9d | 1 | < 0.1% |
| 494dded5b201313c64ed7f100595b95c | 1 | < 0.1% |
| Other values (96201) | 96201 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 192905 | 6.3% |
| c | 192777 | 6.3% |
| 5 | 192746 | 6.3% |
| f | 192673 | 6.3% |
| 1 | 192645 | 6.3% |
| 8 | 192634 | 6.3% |
| b | 192623 | 6.3% |
| 3 | 192599 | 6.3% |
| 7 | 192521 | 6.3% |
| 9 | 192382 | 6.2% |
| Other values (6) | 1152247 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1924091 | |
| Lowercase Letter | 1154661 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 192905 | |
| 5 | 192746 | |
| 1 | 192645 | |
| 8 | 192634 | |
| 3 | 192599 | |
| 7 | 192521 | |
| 9 | 192382 | |
| 6 | 192303 | |
| 0 | 191841 | |
| 4 | 191515 |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 192777 | |
| f | 192673 | |
| b | 192623 | |
| e | 192281 | |
| a | 192157 | |
| d | 192150 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1924091 | |
| Latin | 1154661 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 192905 | |
| 5 | 192746 | |
| 1 | 192645 | |
| 8 | 192634 | |
| 3 | 192599 | |
| 7 | 192521 | |
| 9 | 192382 | |
| 6 | 192303 | |
| 0 | 191841 | |
| 4 | 191515 |
Latin
| Value | Count | Frequency (%) |
| c | 192777 | |
| f | 192673 | |
| b | 192623 | |
| e | 192281 | |
| a | 192157 | |
| d | 192150 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3078752 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 192905 | 6.3% |
| c | 192777 | 6.3% |
| 5 | 192746 | 6.3% |
| f | 192673 | 6.3% |
| 1 | 192645 | 6.3% |
| 8 | 192634 | 6.3% |
| b | 192623 | 6.3% |
| 3 | 192599 | 6.3% |
| 7 | 192521 | 6.3% |
| 9 | 192382 | 6.2% |
| Other values (6) | 1152247 |
order_status
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| delivered |
|---|
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 865899 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | delivered |
|---|---|
| 2nd row | delivered |
| 3rd row | delivered |
| 4th row | delivered |
| 5th row | delivered |
Common Values
| Value | Count | Frequency (%) |
| delivered | 96211 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| delivered | 96211 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 288633 | |
| d | 192422 | |
| l | 96211 | 11.1% |
| i | 96211 | 11.1% |
| v | 96211 | 11.1% |
| r | 96211 | 11.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 865899 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 288633 | |
| d | 192422 | |
| l | 96211 | 11.1% |
| i | 96211 | 11.1% |
| v | 96211 | 11.1% |
| r | 96211 | 11.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 865899 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 288633 | |
| d | 192422 | |
| l | 96211 | 11.1% |
| i | 96211 | 11.1% |
| v | 96211 | 11.1% |
| r | 96211 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 865899 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 288633 | |
| d | 192422 | |
| l | 96211 | 11.1% |
| i | 96211 | 11.1% |
| v | 96211 | 11.1% |
| r | 96211 | 11.1% |
| Distinct | 95689 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Minimum | 2017-01-05 11:56:06 |
|---|---|
| Maximum | 2018-08-29 15:00:37 |
review_score
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.1218468 |
| Minimum | -1 |
|---|---|
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 643 |
| Negative (%) | 0.7% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.3472712 |
|---|---|
| Coefficient of variation (CV) | 0.32686106 |
| Kurtosis | 1.5191145 |
| Mean | 4.1218468 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -1.5746366 |
| Sum | 396567 |
| Variance | 1.8151397 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 56606 | |
| 4 | 18837 | 19.6% |
| 1 | 9314 | 9.7% |
| 3 | 7896 | 8.2% |
| 2 | 2915 | 3.0% |
| -1 | 643 | 0.7% |
| Value | Count | Frequency (%) |
| -1 | 643 | 0.7% |
| 1 | 9314 | 9.7% |
| 2 | 2915 | 3.0% |
| 3 | 7896 | 8.2% |
| 4 | 18837 | 19.6% |
| 5 | 56606 |
| Value | Count | Frequency (%) |
| 5 | 56606 | |
| 4 | 18837 | 19.6% |
| 3 | 7896 | 8.2% |
| 2 | 2915 | 3.0% |
| 1 | 9314 | 9.7% |
| -1 | 643 | 0.7% |
length_comment_title
Real number (ℝ)
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3571629 |
| Minimum | 0 |
|---|---|
| Maximum | 26 |
| Zeros | 85020 |
| Zeros (%) | 88.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 12 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.3093382 |
|---|---|
| Coefficient of variation (CV) | 3.175255 |
| Kurtosis | 11.900615 |
| Mean | 1.3571629 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.4823309 |
| Sum | 130574 |
| Variance | 18.570396 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 85020 | |
| 9 | 1989 | 2.1% |
| 5 | 1111 | 1.2% |
| 15 | 860 | 0.9% |
| 3 | 696 | 0.7% |
| 10 | 559 | 0.6% |
| 17 | 467 | 0.5% |
| 13 | 466 | 0.5% |
| 25 | 404 | 0.4% |
| 14 | 386 | 0.4% |
| Other values (17) | 4253 | 4.4% |
| Value | Count | Frequency (%) |
| 0 | 85020 | |
| 1 | 160 | 0.2% |
| 2 | 253 | 0.3% |
| 3 | 696 | 0.7% |
| 4 | 161 | 0.2% |
| 5 | 1111 | 1.2% |
| 6 | 236 | 0.2% |
| 7 | 362 | 0.4% |
| 8 | 325 | 0.3% |
| 9 | 1989 | 2.1% |
| Value | Count | Frequency (%) |
| 26 | 1 | < 0.1% |
| 25 | 404 | |
| 24 | 206 | |
| 23 | 206 | |
| 22 | 192 | |
| 21 | 226 | |
| 20 | 335 | |
| 19 | 255 | |
| 18 | 291 | |
| 17 | 467 |
length_comment_message
Real number (ℝ)
| Distinct | 209 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.006195 |
| Minimum | 0 |
|---|---|
| Maximum | 208 |
| Zeros | 57489 |
| Zeros (%) | 59.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 39 |
| 95-th percentile | 141 |
| Maximum | 208 |
| Range | 208 |
| Interquartile range (IQR) | 39 |
Descriptive statistics
| Standard deviation | 47.125174 |
|---|---|
| Coefficient of variation (CV) | 1.7449764 |
| Kurtosis | 3.6781595 |
| Mean | 27.006195 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.0530015 |
| Sum | 2598293 |
| Variance | 2220.782 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 57489 | |
| 9 | 991 | 1.0% |
| 5 | 550 | 0.6% |
| 200 | 531 | 0.6% |
| 3 | 510 | 0.5% |
| 26 | 478 | 0.5% |
| 10 | 456 | 0.5% |
| 34 | 450 | 0.5% |
| 20 | 438 | 0.5% |
| 31 | 433 | 0.5% |
| Other values (199) | 33885 |
| Value | Count | Frequency (%) |
| 0 | 57489 | |
| 1 | 96 | 0.1% |
| 2 | 195 | 0.2% |
| 3 | 510 | 0.5% |
| 4 | 97 | 0.1% |
| 5 | 550 | 0.6% |
| 6 | 205 | 0.2% |
| 7 | 227 | 0.2% |
| 8 | 236 | 0.2% |
| 9 | 991 | 1.0% |
| Value | Count | Frequency (%) |
| 208 | 1 | < 0.1% |
| 207 | 1 | < 0.1% |
| 206 | 1 | < 0.1% |
| 205 | 1 | < 0.1% |
| 204 | 12 | < 0.1% |
| 203 | 14 | < 0.1% |
| 202 | 10 | < 0.1% |
| 201 | 19 | < 0.1% |
| 200 | 531 | |
| 199 | 300 |
payment_type
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| credit_card | |
|---|---|
| boleto | |
| credit_card,voucher | 2176 |
| voucher | 1494 |
| debit_card | 1482 |
Length
| Max length | 22 |
|---|---|
| Median length | 11 |
| Mean length | 10.108844 |
| Min length | 6 |
Characters and Unicode
| Total characters | 972582 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | credit_card,voucher |
|---|---|
| 2nd row | credit_card |
| 3rd row | boleto |
| 4th row | credit_card |
| 5th row | credit_card |
Common Values
| Value | Count | Frequency (%) |
| credit_card | 71918 | |
| boleto | 19140 | 19.9% |
| credit_card,voucher | 2176 | 2.3% |
| voucher | 1494 | 1.6% |
| debit_card | 1482 | 1.5% |
| credit_card,debit_card | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| credit_card | 71918 | |
| boleto | 19140 | 19.9% |
| credit_card,voucher | 2176 | 2.3% |
| voucher | 1494 | 1.6% |
| debit_card | 1482 | 1.5% |
| credit_card,debit_card | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 153343 | |
| r | 153343 | |
| d | 151156 | |
| e | 98388 | |
| t | 94718 | |
| i | 75578 | |
| _ | 75578 | |
| a | 75578 | |
| o | 41950 | 4.3% |
| b | 20623 | 2.1% |
| Other values (5) | 32327 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 894827 | |
| Connector Punctuation | 75578 | 7.8% |
| Other Punctuation | 2177 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 153343 | |
| r | 153343 | |
| d | 151156 | |
| e | 98388 | |
| t | 94718 | |
| i | 75578 | |
| a | 75578 | |
| o | 41950 | 4.7% |
| b | 20623 | 2.3% |
| l | 19140 | 2.1% |
| Other values (3) | 11010 | 1.2% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 75578 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2177 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 894827 | |
| Common | 77755 | 8.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 153343 | |
| r | 153343 | |
| d | 151156 | |
| e | 98388 | |
| t | 94718 | |
| i | 75578 | |
| a | 75578 | |
| o | 41950 | 4.7% |
| b | 20623 | 2.3% |
| l | 19140 | 2.1% |
| Other values (3) | 11010 | 1.2% |
Common
| Value | Count | Frequency (%) |
| _ | 75578 | |
| , | 2177 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 972582 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 153343 | |
| r | 153343 | |
| d | 151156 | |
| e | 98388 | |
| t | 94718 | |
| i | 75578 | |
| _ | 75578 | |
| a | 75578 | |
| o | 41950 | 4.3% |
| b | 20623 | 2.1% |
| Other values (5) | 32327 | 3.3% |
payment_installments
Real number (ℝ)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.926048 |
| Minimum | 0 |
|---|---|
| Maximum | 24 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 10 |
| Maximum | 24 |
| Range | 24 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.7113286 |
|---|---|
| Coefficient of variation (CV) | 0.92661796 |
| Kurtosis | 2.3968787 |
| Mean | 2.926048 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.6057481 |
| Sum | 281518 |
| Variance | 7.3513027 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 46709 | |
| 2 | 12001 | 12.5% |
| 3 | 10099 | 10.5% |
| 4 | 6842 | 7.1% |
| 10 | 5103 | 5.3% |
| 5 | 5067 | 5.3% |
| 8 | 4117 | 4.3% |
| 6 | 3777 | 3.9% |
| 7 | 1550 | 1.6% |
| 9 | 615 | 0.6% |
| Other values (14) | 331 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 1 | 46709 | |
| 2 | 12001 | 12.5% |
| 3 | 10099 | 10.5% |
| 4 | 6842 | 7.1% |
| 5 | 5067 | 5.3% |
| 6 | 3777 | 3.9% |
| 7 | 1550 | 1.6% |
| 8 | 4117 | 4.3% |
| 9 | 615 | 0.6% |
| Value | Count | Frequency (%) |
| 24 | 18 | < 0.1% |
| 23 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 21 | 3 | < 0.1% |
| 20 | 16 | < 0.1% |
| 18 | 27 | < 0.1% |
| 17 | 7 | < 0.1% |
| 16 | 5 | < 0.1% |
| 15 | 72 | |
| 14 | 14 | < 0.1% |
payment_value
Real number (ℝ)
| Distinct | 27383 |
|---|---|
| Distinct (%) | 28.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 159.81411 |
| Minimum | 9.59 |
|---|---|
| Maximum | 13664.08 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 9.59 |
|---|---|
| 5-th percentile | 32.38 |
| Q1 | 61.88 |
| median | 105.28 |
| Q3 | 176.26 |
| 95-th percentile | 445.645 |
| Maximum | 13664.08 |
| Range | 13654.49 |
| Interquartile range (IQR) | 114.38 |
Descriptive statistics
| Standard deviation | 218.88163 |
|---|---|
| Coefficient of variation (CV) | 1.3696014 |
| Kurtosis | 249.57272 |
| Mean | 159.81411 |
| Median Absolute Deviation (MAD) | 51.52 |
| Skewness | 9.3788853 |
| Sum | 15375875 |
| Variance | 47909.17 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 77.57 | 250 | 0.3% |
| 35 | 164 | 0.2% |
| 73.34 | 161 | 0.2% |
| 116.94 | 131 | 0.1% |
| 56.78 | 118 | 0.1% |
| 107.78 | 118 | 0.1% |
| 65 | 112 | 0.1% |
| 86.15 | 106 | 0.1% |
| 99.9 | 105 | 0.1% |
| 67.5 | 104 | 0.1% |
| Other values (27373) | 94842 |
| Value | Count | Frequency (%) |
| 9.59 | 1 | |
| 10.07 | 1 | |
| 10.89 | 1 | |
| 11.56 | 1 | |
| 11.62 | 1 | |
| 11.63 | 2 | |
| 12.28 | 1 | |
| 12.39 | 1 | |
| 12.89 | 2 | |
| 13.17 | 1 |
| Value | Count | Frequency (%) |
| 13664.08 | 1 | |
| 7274.88 | 1 | |
| 6929.31 | 1 | |
| 6922.21 | 1 | |
| 6726.66 | 1 | |
| 6081.54 | 1 | |
| 4950.34 | 1 | |
| 4764.34 | 1 | |
| 4681.78 | 1 | |
| 4513.32 | 1 |
nb_items
Real number (ℝ)
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1420732 |
| Minimum | 1 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 21 |
| Range | 20 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.53844026 |
|---|---|
| Coefficient of variation (CV) | 0.47145864 |
| Kurtosis | 116.76065 |
| Mean | 1.1420732 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.5698165 |
| Sum | 109880 |
| Variance | 0.28991791 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 86606 | |
| 2 | 7372 | 7.7% |
| 3 | 1301 | 1.4% |
| 4 | 493 | 0.5% |
| 5 | 192 | 0.2% |
| 6 | 189 | 0.2% |
| 7 | 22 | < 0.1% |
| 10 | 8 | < 0.1% |
| 8 | 8 | < 0.1% |
| 12 | 5 | < 0.1% |
| Other values (7) | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 86606 | |
| 2 | 7372 | 7.7% |
| 3 | 1301 | 1.4% |
| 4 | 493 | 0.5% |
| 5 | 192 | 0.2% |
| 6 | 189 | 0.2% |
| 7 | 22 | < 0.1% |
| 8 | 8 | < 0.1% |
| 9 | 3 | < 0.1% |
| 10 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 2 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 1 | < 0.1% |
| 12 | 5 | |
| 11 | 4 | |
| 10 | 8 | |
| 9 | 3 | < 0.1% |
| 8 | 8 |
sum_price
Real number (ℝ)
| Distinct | 7626 |
|---|---|
| Distinct (%) | 7.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 137.00125 |
| Minimum | 0.85 |
|---|---|
| Maximum | 13440 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0.85 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 45.9 |
| median | 86.5 |
| Q3 | 149.9 |
| 95-th percentile | 399 |
| Maximum | 13440 |
| Range | 13439.15 |
| Interquartile range (IQR) | 104 |
Descriptive statistics
| Standard deviation | 209.11337 |
|---|---|
| Coefficient of variation (CV) | 1.5263611 |
| Kurtosis | 277.40578 |
| Mean | 137.00125 |
| Median Absolute Deviation (MAD) | 47.5 |
| Skewness | 9.8980267 |
| Sum | 13181027 |
| Variance | 43728.402 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 59.9 | 1678 | 1.7% |
| 69.9 | 1569 | 1.6% |
| 49.9 | 1388 | 1.4% |
| 89.9 | 1214 | 1.3% |
| 99.9 | 1160 | 1.2% |
| 79.9 | 982 | 1.0% |
| 39.9 | 952 | 1.0% |
| 29.9 | 943 | 1.0% |
| 19.9 | 900 | 0.9% |
| 29.99 | 856 | 0.9% |
| Other values (7616) | 84569 |
| Value | Count | Frequency (%) |
| 0.85 | 2 | |
| 2.2 | 1 | < 0.1% |
| 2.29 | 1 | < 0.1% |
| 2.9 | 1 | < 0.1% |
| 2.99 | 1 | < 0.1% |
| 3 | 2 | |
| 3.49 | 1 | < 0.1% |
| 3.5 | 1 | < 0.1% |
| 3.54 | 1 | < 0.1% |
| 3.85 | 3 |
| Value | Count | Frequency (%) |
| 13440 | 1 | |
| 7160 | 1 | |
| 6735 | 1 | |
| 6729 | 1 | |
| 6499 | 1 | |
| 5934.6 | 1 | |
| 4799 | 1 | |
| 4690 | 1 | |
| 4590 | 1 | |
| 4400 | 1 |
sum_freight_value
Real number (ℝ)
| Distinct | 7864 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.784223 |
| Minimum | 0 |
|---|---|
| Maximum | 1794.96 |
| Zeros | 336 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7.87 |
| Q1 | 13.85 |
| median | 17.17 |
| Q3 | 24.01 |
| 95-th percentile | 54.765 |
| Maximum | 1794.96 |
| Range | 1794.96 |
| Interquartile range (IQR) | 10.16 |
Descriptive statistics
| Standard deviation | 21.56532 |
|---|---|
| Coefficient of variation (CV) | 0.94650232 |
| Kurtosis | 586.93446 |
| Mean | 22.784223 |
| Median Absolute Deviation (MAD) | 4.38 |
| Skewness | 12.29266 |
| Sum | 2192092.9 |
| Variance | 465.06303 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15.1 | 2897 | 3.0% |
| 7.78 | 1802 | 1.9% |
| 14.1 | 1488 | 1.5% |
| 11.85 | 1423 | 1.5% |
| 18.23 | 1200 | 1.2% |
| 7.39 | 1125 | 1.2% |
| 15.23 | 809 | 0.8% |
| 16.11 | 780 | 0.8% |
| 8.72 | 738 | 0.8% |
| 16.79 | 686 | 0.7% |
| Other values (7854) | 83263 |
| Value | Count | Frequency (%) |
| 0 | 336 | |
| 5.7 | 1 | < 0.1% |
| 5.82 | 1 | < 0.1% |
| 5.88 | 2 | < 0.1% |
| 6.52 | 1 | < 0.1% |
| 6.53 | 2 | < 0.1% |
| 6.56 | 1 | < 0.1% |
| 6.57 | 5 | < 0.1% |
| 6.78 | 5 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1794.96 | 1 | |
| 1002.29 | 1 | |
| 711.33 | 1 | |
| 626.64 | 1 | |
| 502.98 | 1 | |
| 497.42 | 1 | |
| 497.08 | 1 | |
| 479.28 | 1 | |
| 458.73 | 1 | |
| 456.47 | 1 |
customer_unique_id
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 93104 |
|---|---|
| Distinct (%) | 96.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 8d50f5eadf50201ccdcedfb9e2ac8455 | 15 |
|---|---|
| 3e43e6105506432c953e165fb2acf44c | 9 |
| ca77025e7201e3b30c44b472ff346268 | 7 |
| 6469f99c1f9dfae7733b25662e7f1782 | 7 |
| 1b6c7548a2a1f9037c1fd3ddfed95f33 | 7 |
| Other values (93099) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3078752 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 90315 ? |
|---|---|
| Unique (%) | 93.9% |
Sample
| 1st row | 7c396fd4830fd04220f754e42b4e5bff |
|---|---|
| 2nd row | 7c396fd4830fd04220f754e42b4e5bff |
| 3rd row | af07308b275d755c9edb36a90c618231 |
| 4th row | 3a653a41f6f9fc3d2a113cf8398680e8 |
| 5th row | 7c142cf63193a1473d2e66489a9ae977 |
Common Values
| Value | Count | Frequency (%) |
| 8d50f5eadf50201ccdcedfb9e2ac8455 | 15 | < 0.1% |
| 3e43e6105506432c953e165fb2acf44c | 9 | < 0.1% |
| ca77025e7201e3b30c44b472ff346268 | 7 | < 0.1% |
| 6469f99c1f9dfae7733b25662e7f1782 | 7 | < 0.1% |
| 1b6c7548a2a1f9037c1fd3ddfed95f33 | 7 | < 0.1% |
| 63cfc61cee11cbe306bff5857d00bfe4 | 6 | < 0.1% |
| 12f5d6e1cbf93dafd9dcc19095df0b3d | 6 | < 0.1% |
| dc813062e0fc23409cd255f7f53c7074 | 6 | < 0.1% |
| 47c1a3033b8b77b3ab6e109eb4d5fdf3 | 6 | < 0.1% |
| f0e310a6839dce9de1638e0fe5ab282a | 6 | < 0.1% |
| Other values (93094) | 96136 |
Length
| Value | Count | Frequency (%) |
| 8d50f5eadf50201ccdcedfb9e2ac8455 | 15 | < 0.1% |
| 3e43e6105506432c953e165fb2acf44c | 9 | < 0.1% |
| ca77025e7201e3b30c44b472ff346268 | 7 | < 0.1% |
| 6469f99c1f9dfae7733b25662e7f1782 | 7 | < 0.1% |
| 1b6c7548a2a1f9037c1fd3ddfed95f33 | 7 | < 0.1% |
| 63cfc61cee11cbe306bff5857d00bfe4 | 6 | < 0.1% |
| 12f5d6e1cbf93dafd9dcc19095df0b3d | 6 | < 0.1% |
| dc813062e0fc23409cd255f7f53c7074 | 6 | < 0.1% |
| 47c1a3033b8b77b3ab6e109eb4d5fdf3 | 6 | < 0.1% |
| f0e310a6839dce9de1638e0fe5ab282a | 6 | < 0.1% |
| Other values (93094) | 96136 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 192956 | 6.3% |
| 8 | 192846 | 6.3% |
| 1 | 192751 | 6.3% |
| 5 | 192682 | 6.3% |
| d | 192623 | 6.3% |
| a | 192599 | 6.3% |
| e | 192570 | 6.3% |
| 0 | 192538 | 6.3% |
| 9 | 192509 | 6.3% |
| 2 | 192459 | 6.3% |
| Other values (6) | 1152219 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1924536 | |
| Lowercase Letter | 1154216 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 192956 | |
| 8 | 192846 | |
| 1 | 192751 | |
| 5 | 192682 | |
| 0 | 192538 | |
| 9 | 192509 | |
| 2 | 192459 | |
| 3 | 192054 | |
| 4 | 191952 | |
| 7 | 191789 |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 192623 | |
| a | 192599 | |
| e | 192570 | |
| b | 192444 | |
| f | 192228 | |
| c | 191752 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1924536 | |
| Latin | 1154216 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 192956 | |
| 8 | 192846 | |
| 1 | 192751 | |
| 5 | 192682 | |
| 0 | 192538 | |
| 9 | 192509 | |
| 2 | 192459 | |
| 3 | 192054 | |
| 4 | 191952 | |
| 7 | 191789 |
Latin
| Value | Count | Frequency (%) |
| d | 192623 | |
| a | 192599 | |
| e | 192570 | |
| b | 192444 | |
| f | 192228 | |
| c | 191752 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3078752 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 192956 | 6.3% |
| 8 | 192846 | 6.3% |
| 1 | 192751 | 6.3% |
| 5 | 192682 | 6.3% |
| d | 192623 | 6.3% |
| a | 192599 | 6.3% |
| e | 192570 | 6.3% |
| 0 | 192538 | 6.3% |
| 9 | 192509 | 6.3% |
| 2 | 192459 | 6.3% |
| Other values (6) | 1152219 |
customer_city
Categorical
| Distinct | 4083 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| sao paulo | |
|---|---|
| rio de janeiro | 6574 |
| belo horizonte | 2687 |
| brasilia | 2065 |
| curitiba | 1483 |
| Other values (4078) |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 10.342622 |
| Min length | 3 |
Characters and Unicode
| Total characters | 995074 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1140 ? |
|---|---|
| Unique (%) | 1.2% |
Sample
| 1st row | sao paulo |
|---|---|
| 2nd row | sao paulo |
| 3rd row | barreiras |
| 4th row | vianopolis |
| 5th row | sao goncalo do amarante |
Common Values
| Value | Count | Frequency (%) |
| sao paulo | 15014 | 15.6% |
| rio de janeiro | 6574 | 6.8% |
| belo horizonte | 2687 | 2.8% |
| brasilia | 2065 | 2.1% |
| curitiba | 1483 | 1.5% |
| campinas | 1400 | 1.5% |
| porto alegre | 1340 | 1.4% |
| salvador | 1188 | 1.2% |
| guarulhos | 1143 | 1.2% |
| sao bernardo do campo | 908 | 0.9% |
| Other values (4073) | 62409 |
Length
| Value | Count | Frequency (%) |
| sao | 20326 | 12.1% |
| paulo | 15079 | 8.9% |
| de | 9280 | 5.5% |
| rio | 7935 | 4.7% |
| janeiro | 6574 | 3.9% |
| do | 4159 | 2.5% |
| belo | 2746 | 1.6% |
| horizonte | 2711 | 1.6% |
| brasilia | 2074 | 1.2% |
| porto | 1598 | 0.9% |
| Other values (3262) | 96056 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 164130 | |
| o | 122347 | |
| i | 76177 | 7.7% |
| r | 73931 | 7.4% |
| 72327 | 7.3% | |
| e | 64744 | 6.5% |
| s | 60924 | 6.1% |
| n | 44229 | 4.4% |
| u | 43465 | 4.4% |
| l | 43382 | 4.4% |
| Other values (21) | 229418 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 922296 | |
| Space Separator | 72327 | 7.3% |
| Dash Punctuation | 227 | < 0.1% |
| Other Punctuation | 222 | < 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 164130 | |
| o | 122347 | |
| i | 76177 | 8.3% |
| r | 73931 | 8.0% |
| e | 64744 | 7.0% |
| s | 60924 | 6.6% |
| n | 44229 | 4.8% |
| u | 43465 | 4.7% |
| l | 43382 | 4.7% |
| p | 35983 | 3.9% |
| Other values (16) | 192984 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 4 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 72327 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 227 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 222 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 922296 | |
| Common | 72778 | 7.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 164130 | |
| o | 122347 | |
| i | 76177 | 8.3% |
| r | 73931 | 8.0% |
| e | 64744 | 7.0% |
| s | 60924 | 6.6% |
| n | 44229 | 4.8% |
| u | 43465 | 4.7% |
| l | 43382 | 4.7% |
| p | 35983 | 3.9% |
| Other values (16) | 192984 |
Common
| Value | Count | Frequency (%) |
| 72327 | ||
| - | 227 | 0.3% |
| ' | 222 | 0.3% |
| 1 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 995074 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 164130 | |
| o | 122347 | |
| i | 76177 | 7.7% |
| r | 73931 | 7.4% |
| 72327 | 7.3% | |
| e | 64744 | 6.5% |
| s | 60924 | 6.1% |
| n | 44229 | 4.4% |
| u | 43465 | 4.4% |
| l | 43382 | 4.4% |
| Other values (21) | 229418 |
customer_state
Categorical
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| SP | |
|---|---|
| RJ | |
| MG | |
| RS | |
| PR | |
| Other values (22) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 192422 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SP |
|---|---|
| 2nd row | SP |
| 3rd row | BA |
| 4th row | GO |
| 5th row | RN |
Common Values
| Value | Count | Frequency (%) |
| SP | 40406 | |
| RJ | 12310 | 12.8% |
| MG | 11319 | 11.8% |
| RS | 5328 | 5.5% |
| PR | 4903 | 5.1% |
| SC | 3537 | 3.7% |
| BA | 3253 | 3.4% |
| DF | 2074 | 2.2% |
| ES | 1992 | 2.1% |
| GO | 1950 | 2.0% |
| Other values (17) | 9139 | 9.5% |
Length
| Value | Count | Frequency (%) |
| sp | 40406 | |
| rj | 12310 | 12.8% |
| mg | 11319 | 11.8% |
| rs | 5328 | 5.5% |
| pr | 4903 | 5.1% |
| sc | 3537 | 3.7% |
| ba | 3253 | 3.4% |
| df | 2074 | 2.2% |
| es | 1992 | 2.1% |
| go | 1950 | 2.0% |
| Other values (17) | 9139 | 9.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 52296 | |
| P | 48896 | |
| R | 23334 | |
| M | 13763 | 7.2% |
| G | 13269 | 6.9% |
| J | 12310 | 6.4% |
| A | 5596 | 2.9% |
| E | 5184 | 2.7% |
| C | 4890 | 2.5% |
| B | 3769 | 2.0% |
| Other values (7) | 9115 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 192422 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 52296 | |
| P | 48896 | |
| R | 23334 | |
| M | 13763 | 7.2% |
| G | 13269 | 6.9% |
| J | 12310 | 6.4% |
| A | 5596 | 2.9% |
| E | 5184 | 2.7% |
| C | 4890 | 2.5% |
| B | 3769 | 2.0% |
| Other values (7) | 9115 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 192422 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 52296 | |
| P | 48896 | |
| R | 23334 | |
| M | 13763 | 7.2% |
| G | 13269 | 6.9% |
| J | 12310 | 6.4% |
| A | 5596 | 2.9% |
| E | 5184 | 2.7% |
| C | 4890 | 2.5% |
| B | 3769 | 2.0% |
| Other values (7) | 9115 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 192422 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 52296 | |
| P | 48896 | |
| R | 23334 | |
| M | 13763 | 7.2% |
| G | 13269 | 6.9% |
| J | 12310 | 6.4% |
| A | 5596 | 2.9% |
| E | 5184 | 2.7% |
| C | 4890 | 2.5% |
| B | 3769 | 2.0% |
| Other values (7) | 9115 | 4.7% |
product_description_lenght
Real number (ℝ)
| Distinct | 2936 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 782.03096 |
| Minimum | 0 |
|---|---|
| Maximum | 3992 |
| Zeros | 1359 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 138 |
| Q1 | 341 |
| median | 600 |
| Q3 | 986 |
| 95-th percentile | 2118 |
| Maximum | 3992 |
| Range | 3992 |
| Interquartile range (IQR) | 645 |
Descriptive statistics
| Standard deviation | 655.73156 |
|---|---|
| Coefficient of variation (CV) | 0.8384982 |
| Kurtosis | 4.8019174 |
| Mean | 782.03096 |
| Median Absolute Deviation (MAD) | 300 |
| Skewness | 1.9728338 |
| Sum | 75239981 |
| Variance | 429983.87 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1359 | 1.4% |
| 1893 | 570 | 0.6% |
| 341 | 530 | 0.6% |
| 492 | 523 | 0.5% |
| 903 | 471 | 0.5% |
| 245 | 464 | 0.5% |
| 348 | 456 | 0.5% |
| 236 | 419 | 0.4% |
| 366 | 392 | 0.4% |
| 575 | 353 | 0.4% |
| Other values (2926) | 90674 |
| Value | Count | Frequency (%) |
| 0 | 1359 | |
| 4 | 6 | < 0.1% |
| 8 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 20 | 6 | < 0.1% |
| 26 | 2 | < 0.1% |
| 27 | 3 | < 0.1% |
| 28 | 2 | < 0.1% |
| 30 | 6 | < 0.1% |
| 31 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 3992 | 1 | < 0.1% |
| 3988 | 1 | < 0.1% |
| 3985 | 3 | |
| 3976 | 2 | < 0.1% |
| 3963 | 1 | < 0.1% |
| 3956 | 1 | < 0.1% |
| 3954 | 2 | < 0.1% |
| 3950 | 1 | < 0.1% |
| 3948 | 1 | < 0.1% |
| 3947 | 6 |
product_photos_qty
Real number (ℝ)
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.2185509 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 1359 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.7529091 |
|---|---|
| Coefficient of variation (CV) | 0.79011444 |
| Kurtosis | 4.3790034 |
| Mean | 2.2185509 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.819054 |
| Sum | 213449 |
| Variance | 3.0726903 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 46882 | |
| 2 | 18649 | 19.4% |
| 3 | 10947 | 11.4% |
| 4 | 7392 | 7.7% |
| 5 | 4874 | 5.1% |
| 6 | 3319 | 3.4% |
| 7 | 1371 | 1.4% |
| 0 | 1359 | 1.4% |
| 8 | 665 | 0.7% |
| 10 | 314 | 0.3% |
| Other values (10) | 439 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 1359 | 1.4% |
| 1 | 46882 | |
| 2 | 18649 | 19.4% |
| 3 | 10947 | 11.4% |
| 4 | 7392 | 7.7% |
| 5 | 4874 | 5.1% |
| 6 | 3319 | 3.4% |
| 7 | 1371 | 1.4% |
| 8 | 665 | 0.7% |
| 9 | 280 | 0.3% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 18 | 4 | < 0.1% |
| 17 | 8 | < 0.1% |
| 15 | 11 | < 0.1% |
| 14 | 6 | < 0.1% |
| 13 | 26 | < 0.1% |
| 12 | 43 | < 0.1% |
| 11 | 59 | 0.1% |
| 10 | 314 |
product_weight_g
Real number (ℝ)
| Distinct | 2152 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2095.437 |
| Minimum | -1 |
|---|---|
| Maximum | 40425 |
| Zeros | 6 |
| Zeros (%) | < 0.1% |
| Negative | 16 |
| Negative (%) | < 0.1% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 125 |
| Q1 | 300 |
| median | 700 |
| Q3 | 1800 |
| 95-th percentile | 9750 |
| Maximum | 40425 |
| Range | 40426 |
| Interquartile range (IQR) | 1500 |
Descriptive statistics
| Standard deviation | 3750.9211 |
|---|---|
| Coefficient of variation (CV) | 1.7900424 |
| Kurtosis | 16.413126 |
| Mean | 2095.437 |
| Median Absolute Deviation (MAD) | 500 |
| Skewness | 3.6095692 |
| Sum | 2.0160409 × 108 |
| Variance | 14069409 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 200 | 5795 | 6.0% |
| 150 | 4557 | 4.7% |
| 250 | 3927 | 4.1% |
| 300 | 3627 | 3.8% |
| 400 | 3104 | 3.2% |
| 100 | 3042 | 3.2% |
| 350 | 2778 | 2.9% |
| 500 | 2285 | 2.4% |
| 600 | 2226 | 2.3% |
| 700 | 1703 | 1.8% |
| Other values (2142) | 63167 |
| Value | Count | Frequency (%) |
| -1 | 16 | < 0.1% |
| 0 | 6 | < 0.1% |
| 2 | 5 | < 0.1% |
| 25 | 3 | < 0.1% |
| 50 | 785 | |
| 53 | 2 | < 0.1% |
| 54 | 1 | < 0.1% |
| 55 | 1 | < 0.1% |
| 58 | 1 | < 0.1% |
| 60 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 40425 | 3 | < 0.1% |
| 30000 | 243 | |
| 29800 | 1 | < 0.1% |
| 29700 | 2 | < 0.1% |
| 29600 | 5 | < 0.1% |
| 29500 | 1 | < 0.1% |
| 29250 | 1 | < 0.1% |
| 29150 | 1 | < 0.1% |
| 29100 | 1 | < 0.1% |
| 29050 | 4 | < 0.1% |
product_length_cm
Real number (ℝ)
| Distinct | 100 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.075896 |
| Minimum | -1 |
|---|---|
| Maximum | 105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 16 |
| Negative (%) | < 0.1% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 18 |
| median | 25 |
| Q3 | 38 |
| 95-th percentile | 61 |
| Maximum | 105 |
| Range | 106 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 16.096074 |
|---|---|
| Coefficient of variation (CV) | 0.53518188 |
| Kurtosis | 3.7716934 |
| Mean | 30.075896 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 1.7650211 |
| Sum | 2893632 |
| Variance | 259.08361 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 14918 | 15.5% |
| 20 | 8897 | 9.2% |
| 30 | 6181 | 6.4% |
| 17 | 5219 | 5.4% |
| 18 | 5011 | 5.2% |
| 19 | 4046 | 4.2% |
| 25 | 4042 | 4.2% |
| 40 | 3463 | 3.6% |
| 22 | 3348 | 3.5% |
| 35 | 2514 | 2.6% |
| Other values (90) | 38572 |
| Value | Count | Frequency (%) |
| -1 | 16 | < 0.1% |
| 7 | 30 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 4 | < 0.1% |
| 10 | 7 | < 0.1% |
| 11 | 82 | |
| 12 | 34 | < 0.1% |
| 13 | 47 | < 0.1% |
| 14 | 117 | |
| 15 | 175 |
| Value | Count | Frequency (%) |
| 105 | 281 | |
| 104 | 29 | < 0.1% |
| 103 | 34 | < 0.1% |
| 102 | 42 | < 0.1% |
| 101 | 87 | 0.1% |
| 100 | 297 | |
| 99 | 31 | < 0.1% |
| 98 | 41 | < 0.1% |
| 97 | 10 | < 0.1% |
| 96 | 8 | < 0.1% |
product_height_cm
Real number (ℝ)
| Distinct | 103 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.42888 |
| Minimum | -1 |
|---|---|
| Maximum | 105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 16 |
| Negative (%) | < 0.1% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 8 |
| median | 13 |
| Q3 | 20 |
| 95-th percentile | 44 |
| Maximum | 105 |
| Range | 106 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 13.26808 |
|---|---|
| Coefficient of variation (CV) | 0.80760709 |
| Kurtosis | 7.4824695 |
| Mean | 16.42888 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 2.2585977 |
| Sum | 1580639 |
| Variance | 176.04195 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 8323 | 8.7% |
| 20 | 5693 | 5.9% |
| 12 | 5533 | 5.8% |
| 15 | 5527 | 5.7% |
| 11 | 5349 | 5.6% |
| 2 | 4336 | 4.5% |
| 4 | 4136 | 4.3% |
| 8 | 3964 | 4.1% |
| 16 | 3881 | 4.0% |
| 5 | 3823 | 4.0% |
| Other values (93) | 45646 |
| Value | Count | Frequency (%) |
| -1 | 16 | < 0.1% |
| 2 | 4336 | |
| 3 | 2297 | 2.4% |
| 4 | 4136 | |
| 5 | 3823 | |
| 6 | 2966 | 3.1% |
| 7 | 3642 | |
| 8 | 3964 | |
| 9 | 2755 | 2.9% |
| 10 | 8323 |
| Value | Count | Frequency (%) |
| 105 | 105 | |
| 104 | 10 | < 0.1% |
| 103 | 37 | < 0.1% |
| 102 | 7 | < 0.1% |
| 100 | 36 | < 0.1% |
| 99 | 5 | < 0.1% |
| 98 | 3 | < 0.1% |
| 97 | 1 | < 0.1% |
| 96 | 5 | < 0.1% |
| 95 | 21 | < 0.1% |
product_width_cm
Real number (ℝ)
| Distinct | 94 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.002661 |
| Minimum | -1 |
|---|---|
| Maximum | 118 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 16 |
| Negative (%) | < 0.1% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 15 |
| median | 20 |
| Q3 | 30 |
| 95-th percentile | 45 |
| Maximum | 118 |
| Range | 119 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 11.730453 |
|---|---|
| Coefficient of variation (CV) | 0.5099607 |
| Kurtosis | 4.5873749 |
| Mean | 23.002661 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.7135749 |
| Sum | 2213109 |
| Variance | 137.60353 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 10209 | 10.6% |
| 11 | 8974 | 9.3% |
| 15 | 7715 | 8.0% |
| 16 | 7188 | 7.5% |
| 30 | 6297 | 6.5% |
| 12 | 4740 | 4.9% |
| 13 | 4592 | 4.8% |
| 14 | 3981 | 4.1% |
| 18 | 3481 | 3.6% |
| 40 | 3296 | 3.4% |
| Other values (84) | 35738 |
| Value | Count | Frequency (%) |
| -1 | 16 | < 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 16 | < 0.1% |
| 9 | 46 | < 0.1% |
| 10 | 66 | 0.1% |
| 11 | 8974 | |
| 12 | 4740 | |
| 13 | 4592 | |
| 14 | 3981 |
| Value | Count | Frequency (%) |
| 118 | 7 | < 0.1% |
| 105 | 13 | < 0.1% |
| 104 | 1 | < 0.1% |
| 102 | 2 | < 0.1% |
| 101 | 2 | < 0.1% |
| 100 | 40 | |
| 98 | 1 | < 0.1% |
| 97 | 1 | < 0.1% |
| 95 | 2 | < 0.1% |
| 93 | 12 | < 0.1% |
product_category_name_english
Categorical
| Distinct | 72 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| bed_bath_table | |
|---|---|
| health_beauty | |
| sports_leisure | |
| computers_accessories | |
| furniture_decor | |
| Other values (67) |
Length
| Max length | 39 |
|---|---|
| Median length | 31 |
| Mean length | 12.773207 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1228923 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | housewares |
|---|---|
| 2nd row | baby |
| 3rd row | perfumery |
| 4th row | auto |
| 5th row | pet_shop |
Common Values
| Value | Count | Frequency (%) |
| bed_bath_table | 9153 | 9.5% |
| health_beauty | 8578 | 8.9% |
| sports_leisure | 7474 | 7.8% |
| computers_accessories | 6489 | 6.7% |
| furniture_decor | 6169 | 6.4% |
| housewares | 5670 | 5.9% |
| watches_gifts | 5474 | 5.7% |
| telephony | 4076 | 4.2% |
| auto | 3783 | 3.9% |
| toys | 3747 | 3.9% |
| Other values (62) | 35598 |
Length
| Value | Count | Frequency (%) |
| bed_bath_table | 9153 | 9.5% |
| health_beauty | 8578 | 8.9% |
| sports_leisure | 7474 | 7.8% |
| computers_accessories | 6489 | 6.7% |
| furniture_decor | 6169 | 6.4% |
| housewares | 5670 | 5.9% |
| watches_gifts | 5474 | 5.7% |
| telephony | 4076 | 4.2% |
| auto | 3783 | 3.9% |
| toys | 3747 | 3.9% |
| Other values (62) | 35598 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 148905 | |
| s | 116596 | 9.5% |
| t | 108382 | 8.8% |
| o | 91321 | 7.4% |
| a | 83611 | 6.8% |
| r | 82212 | 6.7% |
| _ | 81487 | 6.6% |
| u | 63059 | 5.1% |
| c | 58345 | 4.7% |
| i | 50672 | 4.1% |
| Other values (15) | 344333 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1147187 | |
| Connector Punctuation | 81487 | 6.6% |
| Decimal Number | 249 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 148905 | |
| s | 116596 | 10.2% |
| t | 108382 | 9.4% |
| o | 91321 | 8.0% |
| a | 83611 | 7.3% |
| r | 82212 | 7.2% |
| u | 63059 | 5.5% |
| c | 58345 | 5.1% |
| i | 50672 | 4.4% |
| h | 49140 | 4.3% |
| Other values (13) | 294944 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 81487 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 249 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1147187 | |
| Common | 81736 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 148905 | |
| s | 116596 | 10.2% |
| t | 108382 | 9.4% |
| o | 91321 | 8.0% |
| a | 83611 | 7.3% |
| r | 82212 | 7.2% |
| u | 63059 | 5.5% |
| c | 58345 | 5.1% |
| i | 50672 | 4.4% |
| h | 49140 | 4.3% |
| Other values (13) | 294944 |
Common
| Value | Count | Frequency (%) |
| _ | 81487 | |
| 2 | 249 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1228923 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 148905 | |
| s | 116596 | 9.5% |
| t | 108382 | 8.8% |
| o | 91321 | 7.4% |
| a | 83611 | 6.8% |
| r | 82212 | 6.7% |
| _ | 81487 | 6.6% |
| u | 63059 | 5.1% |
| c | 58345 | 4.7% |
| i | 50672 | 4.1% |
| Other values (15) | 344333 |
order_purchase_time
Categorical
| Distinct | 602 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 11/24/2017 | 1147 |
|---|---|
| 11/25/2017 | 487 |
| 11/27/2017 | 395 |
| 11/26/2017 | 382 |
| 11/28/2017 | 372 |
| Other values (597) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 962110 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10/02/2017 |
|---|---|
| 2nd row | 09/04/2017 |
| 3rd row | 07/24/2018 |
| 4th row | 08/08/2018 |
| 5th row | 11/18/2017 |
Common Values
| Value | Count | Frequency (%) |
| 11/24/2017 | 1147 | 1.2% |
| 11/25/2017 | 487 | 0.5% |
| 11/27/2017 | 395 | 0.4% |
| 11/26/2017 | 382 | 0.4% |
| 11/28/2017 | 372 | 0.4% |
| 05/07/2018 | 363 | 0.4% |
| 08/06/2018 | 363 | 0.4% |
| 05/14/2018 | 355 | 0.4% |
| 08/07/2018 | 353 | 0.4% |
| 05/16/2018 | 351 | 0.4% |
| Other values (592) | 91643 |
Length
| Value | Count | Frequency (%) |
| 11/24/2017 | 1147 | 1.2% |
| 11/25/2017 | 487 | 0.5% |
| 11/27/2017 | 395 | 0.4% |
| 11/26/2017 | 382 | 0.4% |
| 11/28/2017 | 372 | 0.4% |
| 05/07/2018 | 363 | 0.4% |
| 08/06/2018 | 363 | 0.4% |
| 05/14/2018 | 355 | 0.4% |
| 08/07/2018 | 353 | 0.4% |
| 05/16/2018 | 351 | 0.4% |
| Other values (592) | 91643 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 217045 | |
| / | 192422 | |
| 1 | 171900 | |
| 2 | 150181 | |
| 8 | 72795 | 7.6% |
| 7 | 62839 | 6.5% |
| 3 | 23000 | 2.4% |
| 5 | 20166 | 2.1% |
| 4 | 19492 | 2.0% |
| 6 | 19216 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 769688 | |
| Other Punctuation | 192422 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 217045 | |
| 1 | 171900 | |
| 2 | 150181 | |
| 8 | 72795 | 9.5% |
| 7 | 62839 | 8.2% |
| 3 | 23000 | 3.0% |
| 5 | 20166 | 2.6% |
| 4 | 19492 | 2.5% |
| 6 | 19216 | 2.5% |
| 9 | 13054 | 1.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 192422 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 962110 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 217045 | |
| / | 192422 | |
| 1 | 171900 | |
| 2 | 150181 | |
| 8 | 72795 | 7.6% |
| 7 | 62839 | 6.5% |
| 3 | 23000 | 2.4% |
| 5 | 20166 | 2.1% |
| 4 | 19492 | 2.0% |
| 6 | 19216 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 962110 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 217045 | |
| / | 192422 | |
| 1 | 171900 | |
| 2 | 150181 | |
| 8 | 72795 | 7.6% |
| 7 | 62839 | 6.5% |
| 3 | 23000 | 2.4% |
| 5 | 20166 | 2.1% |
| 4 | 19492 | 2.0% |
| 6 | 19216 | 2.0% |
recence
Real number (ℝ)
| Distinct | 602 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 292.27314 |
| Minimum | 54 |
|---|---|
| Maximum | 655 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 54 |
|---|---|
| 5-th percentile | 77 |
| Q1 | 169 |
| median | 276 |
| Q3 | 401 |
| 95-th percentile | 570 |
| Maximum | 655 |
| Range | 601 |
| Interquartile range (IQR) | 232 |
Descriptive statistics
| Standard deviation | 150.90353 |
|---|---|
| Coefficient of variation (CV) | 0.51630995 |
| Kurtosis | -0.78139219 |
| Mean | 292.27314 |
| Median Absolute Deviation (MAD) | 113 |
| Skewness | 0.38583269 |
| Sum | 28119891 |
| Variance | 22771.876 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 332 | 1167 | 1.2% |
| 331 | 501 | 0.5% |
| 329 | 411 | 0.4% |
| 330 | 392 | 0.4% |
| 328 | 388 | 0.4% |
| 77 | 365 | 0.4% |
| 168 | 362 | 0.4% |
| 76 | 356 | 0.4% |
| 161 | 351 | 0.4% |
| 159 | 351 | 0.4% |
| Other values (592) | 91567 |
| Value | Count | Frequency (%) |
| 54 | 11 | < 0.1% |
| 55 | 43 | < 0.1% |
| 56 | 70 | 0.1% |
| 57 | 75 | 0.1% |
| 58 | 71 | 0.1% |
| 59 | 99 | 0.1% |
| 60 | 146 | |
| 61 | 185 | |
| 62 | 245 | |
| 63 | 257 |
| Value | Count | Frequency (%) |
| 655 | 32 | |
| 654 | 4 | < 0.1% |
| 653 | 4 | < 0.1% |
| 652 | 4 | < 0.1% |
| 651 | 5 | < 0.1% |
| 650 | 6 | < 0.1% |
| 649 | 9 | < 0.1% |
| 648 | 12 | < 0.1% |
| 647 | 10 | < 0.1% |
| 646 | 16 |
recence_score
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 845.8 KiB |
| 3 | |
|---|---|
| 5 | |
| 4 | |
| 1 | |
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 96211 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 5 |
| 4th row | 5 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 19431 | |
| 5 | 19325 | |
| 4 | 19231 | |
| 1 | 19179 | |
| 2 | 19045 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 19431 | |
| 5 | 19325 | |
| 4 | 19231 | |
| 1 | 19179 | |
| 2 | 19045 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 19431 | |
| 5 | 19325 | |
| 4 | 19231 | |
| 1 | 19179 | |
| 2 | 19045 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 96211 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 19431 | |
| 5 | 19325 | |
| 4 | 19231 | |
| 1 | 19179 | |
| 2 | 19045 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 96211 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 19431 | |
| 5 | 19325 | |
| 4 | 19231 | |
| 1 | 19179 | |
| 2 | 19045 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 96211 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 19431 | |
| 5 | 19325 | |
| 4 | 19231 | |
| 1 | 19179 | |
| 2 | 19045 |
| review_score | length_comment_title | length_comment_message | payment_installments | payment_value | nb_items | sum_price | sum_freight_value | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | recence | payment_type | customer_state | product_category_name_english | recence_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| review_score | 1.000 | -0.018 | -0.223 | -0.022 | -0.040 | -0.108 | -0.030 | -0.089 | 0.016 | 0.013 | -0.013 | -0.015 | -0.003 | -0.009 | -0.020 | 0.018 | 0.044 | 0.043 | 0.043 |
| length_comment_title | -0.018 | 1.000 | 0.321 | 0.005 | 0.032 | 0.026 | 0.027 | 0.054 | 0.031 | 0.006 | -0.011 | -0.031 | -0.003 | -0.019 | -0.420 | 0.017 | 0.013 | 0.034 | 0.226 |
| length_comment_message | -0.223 | 0.321 | 1.000 | 0.044 | 0.064 | 0.083 | 0.058 | 0.069 | -0.008 | -0.005 | 0.035 | 0.012 | 0.019 | 0.013 | 0.016 | 0.000 | 0.021 | 0.020 | 0.014 |
| payment_installments | -0.022 | 0.005 | 0.044 | 1.000 | 0.382 | 0.058 | 0.375 | 0.231 | 0.037 | 0.002 | 0.220 | 0.118 | 0.121 | 0.137 | 0.045 | 0.199 | 0.034 | 0.090 | 0.040 |
| payment_value | -0.040 | 0.032 | 0.064 | 0.382 | 1.000 | 0.222 | 0.990 | 0.567 | 0.193 | 0.006 | 0.520 | 0.268 | 0.347 | 0.275 | -0.014 | 0.008 | 0.015 | 0.099 | 0.007 |
| nb_items | -0.108 | 0.026 | 0.083 | 0.058 | 0.222 | 1.000 | 0.178 | 0.378 | -0.036 | -0.056 | -0.004 | 0.008 | 0.004 | 0.001 | 0.003 | 0.008 | 0.000 | 0.027 | 0.008 |
| sum_price | -0.030 | 0.027 | 0.058 | 0.375 | 0.990 | 0.178 | 1.000 | 0.469 | 0.197 | 0.011 | 0.507 | 0.256 | 0.339 | 0.265 | -0.010 | 0.008 | 0.013 | 0.093 | 0.007 |
| sum_freight_value | -0.089 | 0.054 | 0.069 | 0.231 | 0.567 | 0.378 | 0.469 | 1.000 | 0.101 | -0.010 | 0.419 | 0.272 | 0.272 | 0.262 | -0.044 | 0.000 | 0.030 | 0.054 | 0.007 |
| product_description_lenght | 0.016 | 0.031 | -0.008 | 0.037 | 0.193 | -0.036 | 0.197 | 0.101 | 1.000 | 0.154 | 0.100 | -0.010 | 0.132 | -0.060 | -0.065 | 0.012 | 0.019 | 0.214 | 0.044 |
| product_photos_qty | 0.013 | 0.006 | -0.005 | 0.002 | 0.006 | -0.056 | 0.011 | -0.010 | 0.154 | 1.000 | 0.015 | 0.009 | -0.067 | -0.004 | 0.001 | 0.000 | 0.012 | 0.151 | 0.027 |
| product_weight_g | -0.013 | -0.011 | 0.035 | 0.220 | 0.520 | -0.004 | 0.507 | 0.419 | 0.100 | 0.015 | 1.000 | 0.622 | 0.536 | 0.624 | 0.053 | 0.011 | 0.012 | 0.193 | 0.023 |
| product_length_cm | -0.015 | -0.031 | 0.012 | 0.118 | 0.268 | 0.008 | 0.256 | 0.272 | -0.010 | 0.009 | 0.622 | 1.000 | 0.260 | 0.640 | 0.075 | 0.010 | 0.008 | 0.257 | 0.046 |
| product_height_cm | -0.003 | -0.003 | 0.019 | 0.121 | 0.347 | 0.004 | 0.339 | 0.272 | 0.132 | -0.067 | 0.536 | 0.260 | 1.000 | 0.345 | 0.015 | 0.015 | 0.013 | 0.277 | 0.041 |
| product_width_cm | -0.009 | -0.019 | 0.013 | 0.137 | 0.275 | 0.001 | 0.265 | 0.262 | -0.060 | -0.004 | 0.624 | 0.640 | 0.345 | 1.000 | 0.057 | 0.012 | 0.012 | 0.279 | 0.040 |
| recence | -0.020 | -0.420 | 0.016 | 0.045 | -0.014 | 0.003 | -0.010 | -0.044 | -0.065 | 0.001 | 0.053 | 0.075 | 0.015 | 0.057 | 1.000 | 0.040 | 0.027 | 0.097 | 0.882 |
| payment_type | 0.018 | 0.017 | 0.000 | 0.199 | 0.008 | 0.008 | 0.008 | 0.000 | 0.012 | 0.000 | 0.011 | 0.010 | 0.015 | 0.012 | 0.040 | 1.000 | 0.030 | 0.036 | 0.046 |
| customer_state | 0.044 | 0.013 | 0.021 | 0.034 | 0.015 | 0.000 | 0.013 | 0.030 | 0.019 | 0.012 | 0.012 | 0.008 | 0.013 | 0.012 | 0.027 | 0.030 | 1.000 | 0.030 | 0.035 |
| product_category_name_english | 0.043 | 0.034 | 0.020 | 0.090 | 0.099 | 0.027 | 0.093 | 0.054 | 0.214 | 0.151 | 0.193 | 0.257 | 0.277 | 0.279 | 0.097 | 0.036 | 0.030 | 1.000 | 0.128 |
| recence_score | 0.043 | 0.226 | 0.014 | 0.040 | 0.007 | 0.008 | 0.007 | 0.007 | 0.044 | 0.027 | 0.023 | 0.046 | 0.041 | 0.040 | 0.882 | 0.046 | 0.035 | 0.128 | 1.000 |
| customer_id | order_status | order_purchase_timestamp | review_score | length_comment_title | length_comment_message | payment_type | payment_installments | payment_value | nb_items | sum_price | sum_freight_value | customer_unique_id | customer_city | customer_state | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | product_category_name_english | order_purchase_time | recence | recence_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 9ef432eb6251297304e76186b10a928d | delivered | 2017-10-02 10:56:33 | 4.0 | 0.0 | 170.0 | credit_card,voucher | 1.0 | 38.71 | 1.0 | 29.99 | 8.72 | 7c396fd4830fd04220f754e42b4e5bff | sao paulo | SP | 268.0 | 4.0 | 500.0 | 19.0 | 8.0 | 13.0 | housewares | 10/02/2017 | 385.0 | 2 |
| 1 | 31f31efcb333fcbad2b1371c8cf0fa84 | delivered | 2017-09-04 11:26:38 | 5.0 | 0.0 | 102.0 | credit_card | 1.0 | 44.11 | 1.0 | 35.39 | 8.72 | 7c396fd4830fd04220f754e42b4e5bff | sao paulo | SP | 2395.0 | 1.0 | 350.0 | 19.0 | 14.0 | 12.0 | baby | 09/04/2017 | 385.0 | 2 |
| 2 | b0830fb4747a6c6d20dea0b8c802d7ef | delivered | 2018-07-24 20:41:37 | 4.0 | 16.0 | 20.0 | boleto | 1.0 | 141.46 | 1.0 | 118.70 | 22.76 | af07308b275d755c9edb36a90c618231 | barreiras | BA | 178.0 | 1.0 | 400.0 | 19.0 | 13.0 | 19.0 | perfumery | 07/24/2018 | 90.0 | 5 |
| 3 | 41ce2a54c0b03bf3443c3d931a367089 | delivered | 2018-08-08 08:38:49 | 5.0 | 0.0 | 0.0 | credit_card | 3.0 | 179.12 | 1.0 | 159.90 | 19.22 | 3a653a41f6f9fc3d2a113cf8398680e8 | vianopolis | GO | 232.0 | 1.0 | 420.0 | 24.0 | 19.0 | 21.0 | auto | 08/08/2018 | 75.0 | 5 |
| 4 | f88197465ea7920adcdbec7375364d82 | delivered | 2017-11-18 19:28:06 | 5.0 | 0.0 | 105.0 | credit_card | 1.0 | 72.20 | 1.0 | 45.00 | 27.20 | 7c142cf63193a1473d2e66489a9ae977 | sao goncalo do amarante | RN | 468.0 | 3.0 | 450.0 | 30.0 | 10.0 | 20.0 | pet_shop | 11/18/2017 | 338.0 | 2 |
| 5 | 8ab97904e6daea8866dbdbc4fb7aad2c | delivered | 2018-02-13 21:18:39 | 5.0 | 0.0 | 0.0 | credit_card | 1.0 | 28.62 | 1.0 | 19.90 | 8.72 | 72632f0f9dd73dfee390c9b22eb56dd6 | santo andre | SP | 316.0 | 4.0 | 250.0 | 51.0 | 15.0 | 15.0 | stationery | 02/13/2018 | 251.0 | 3 |
| 6 | 503740e9ca751ccdda7ba28e9ab8f608 | delivered | 2017-07-09 21:57:05 | 4.0 | 0.0 | 0.0 | credit_card | 6.0 | 175.26 | 1.0 | 147.90 | 27.36 | 80bb27c7c16e8f973207a5086ab329e2 | congonhinhas | PR | 608.0 | 1.0 | 7150.0 | 65.0 | 10.0 | 65.0 | auto | 07/09/2017 | 470.0 | 1 |
| 7 | 9bdf08b4b3b52b5526ff42d37d47f222 | delivered | 2017-05-16 13:10:30 | 5.0 | 0.0 | 0.0 | credit_card | 3.0 | 75.16 | 1.0 | 59.99 | 15.17 | 932afa1e708222e5821dac9cd5db4cae | nilopolis | RJ | 956.0 | 1.0 | 50.0 | 16.0 | 16.0 | 17.0 | auto | 05/16/2017 | 524.0 | 1 |
| 8 | f54a9f0e6b351c431402b8461ea51999 | delivered | 2017-01-23 18:29:09 | 1.0 | 0.0 | 0.0 | boleto | 1.0 | 35.95 | 1.0 | 19.90 | 16.05 | 39382392765b6dc74812866ee5ee92a7 | faxinalzinho | RS | 432.0 | 2.0 | 300.0 | 35.0 | 35.0 | 15.0 | furniture_decor | 01/23/2017 | 637.0 | 1 |
| 9 | 31ad1d1b63eb9962463f764d4e6e0c9d | delivered | 2017-07-29 11:55:02 | 5.0 | 0.0 | 0.0 | credit_card,voucher | 1.0 | 169.76 | 1.0 | 149.99 | 19.77 | 299905e3934e9e181bfb2e164dd4b4f8 | sorocaba | SP | 527.0 | 1.0 | 9750.0 | 42.0 | 41.0 | 42.0 | office_furniture | 07/29/2017 | 450.0 | 1 |
| customer_id | order_status | order_purchase_timestamp | review_score | length_comment_title | length_comment_message | payment_type | payment_installments | payment_value | nb_items | sum_price | sum_freight_value | customer_unique_id | customer_city | customer_state | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | product_category_name_english | order_purchase_time | recence | recence_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 96201 | 8e1ec396e317ff4c82a03ce16a0c3eb3 | delivered | 2017-10-27 15:21:00 | 5.0 | 0.0 | 77.0 | credit_card | 3.0 | 164.30 | 1.0 | 142.50 | 21.80 | 1a3b8f1d0782ebedbcf220a96cbc1655 | maceio | AL | 178.0 | 1.0 | 400.0 | 19.0 | 13.0 | 19.0 | perfumery | 10/27/2017 | 360.0 | 2 |
| 96202 | a2f7428f0cafbc8e59f20e1444b67315 | delivered | 2017-12-20 09:52:41 | 1.0 | 0.0 | 86.0 | credit_card | 1.0 | 71.04 | 1.0 | 55.90 | 15.14 | a49e8e11e850592fe685ae3c64b40eca | campo do tenente | PR | 372.0 | 2.0 | 300.0 | 16.0 | 6.0 | 12.0 | musical_instruments | 12/20/2017 | 306.0 | 3 |
| 96203 | da2124f134f5dfbce9d06f29bdb6c308 | delivered | 2017-10-04 19:57:37 | 5.0 | 0.0 | 0.0 | credit_card,voucher | 2.0 | 106.79 | 2.0 | 69.01 | 37.78 | c716cf2b5b86fb24257cffe9e7969df8 | cuiaba | MT | 180.0 | 3.0 | 750.0 | 26.0 | 15.0 | 26.0 | toys | 10/04/2017 | 383.0 | 2 |
| 96204 | f01a6bfcc730456317e4081fe0c9940e | delivered | 2017-01-27 00:30:03 | 5.0 | 0.0 | 0.0 | credit_card,voucher | 5.0 | 389.43 | 1.0 | 370.00 | 19.43 | e03dbdf5e56c96b106d8115ac336f47f | divinopolis | MG | 657.0 | 1.0 | 750.0 | 38.0 | 12.0 | 25.0 | health_beauty | 01/27/2017 | 633.0 | 1 |
| 96205 | 47cd45a6ac7b9fb16537df2ccffeb5ac | delivered | 2017-02-23 09:05:12 | 5.0 | 0.0 | 0.0 | credit_card | 3.0 | 155.99 | 1.0 | 139.90 | 16.09 | 831ce3f1bacbd424fc4e38fbd4d66d29 | sao paulo | SP | 254.0 | 2.0 | 2500.0 | 49.0 | 13.0 | 41.0 | furniture_decor | 02/23/2017 | 606.0 | 1 |
| 96206 | 39bd1228ee8140590ac3aca26f2dfe00 | delivered | 2017-03-09 09:54:05 | 5.0 | 0.0 | 0.0 | credit_card | 3.0 | 85.08 | 1.0 | 72.00 | 13.08 | 6359f309b166b0196dbf7ad2ac62bb5a | sao jose dos campos | SP | 1517.0 | 1.0 | 1175.0 | 22.0 | 13.0 | 18.0 | health_beauty | 03/09/2017 | 592.0 | 1 |
| 96207 | 1fca14ff2861355f6e5f14306ff977a7 | delivered | 2018-02-06 12:58:58 | 4.0 | 0.0 | 44.0 | credit_card | 3.0 | 195.00 | 1.0 | 174.90 | 20.10 | da62f9e57a76d978d02ab5362c509660 | praia grande | SP | 828.0 | 4.0 | 4950.0 | 40.0 | 10.0 | 40.0 | baby | 02/06/2018 | 258.0 | 3 |
| 96208 | 1aa71eb042121263aafbe80c1b562c9c | delivered | 2017-08-27 14:46:43 | 5.0 | 0.0 | 28.0 | credit_card | 5.0 | 271.01 | 1.0 | 205.99 | 65.02 | 737520a9aad80b3fbbdad19b66b37b30 | nova vicosa | BA | 500.0 | 2.0 | 13300.0 | 32.0 | 90.0 | 22.0 | home_appliances_2 | 08/27/2017 | 421.0 | 2 |
| 96209 | b331b74b18dc79bcdf6532d51e1637c1 | delivered | 2018-01-08 21:28:27 | 2.0 | 0.0 | 53.0 | credit_card | 4.0 | 441.16 | 2.0 | 359.98 | 81.18 | 5097a5312c8b157bb7be58ae360ef43c | japuiba | RJ | 1893.0 | 1.0 | 6550.0 | 20.0 | 20.0 | 20.0 | computers_accessories | 01/08/2018 | 287.0 | 3 |
| 96210 | edb027a75a1449115f6b43211ae02a24 | delivered | 2018-03-08 20:57:30 | 5.0 | 0.0 | 0.0 | debit_card | 1.0 | 86.86 | 1.0 | 68.50 | 18.36 | 60350aa974b26ff12caad89e55993bd6 | lapa | PR | 569.0 | 1.0 | 150.0 | 16.0 | 7.0 | 15.0 | health_beauty | 03/08/2018 | 228.0 | 4 |